Multivariate Functional Regression Models for Epistasis Analysis
نویسندگان
چکیده
To date, most genetic analyses of phenotypes have focused on analyzing single traits or, analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power, and hold the key to understanding the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains " fundamentally unexplored ". To fill this gap, we formulate a test for interaction between two gens in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large scale simulations to calculate its type I error rates for testing interaction between two genes with multiple phenotypes and to compare its power with multivariate pair-wise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate its performance, the MFRG for epistasis analysis is applied to five phenotypes and exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 136 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has much higher power to detect interaction than the interaction analysis of single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes. Author Summary Most genetic analyses of complex traits have focused on a single trait association analysis, analyzing each phenotype independently, and additive model in which genetic variation is assumed to contribute independently, additively and cumulatively to the trait. However, multiple phenotypes are correlated. Complex traits are influenced by many genetic and environmental factors and their interactions. Joint gene-gene (GxG) interaction analysis of multiple complementary traits will increase statistical power to identify GxG interactions, and hold the key to understanding the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying GxG interactions in multiple phenotypes remains less developed owing to its potential complexity. Therefore, we propose to develop a multiple functional regression (MFRG) model in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants for simultaneous GxG interaction …
منابع مشابه
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits...
متن کاملComparison of hybrid regression and multivariate regression in the regional flood frequency analysis: A case study in Khorasan Razavi province
Background: Magnitude, rate and frequency of the stochastic and unexpected events are of great significance and importance in hydrology. Nowadays, for economic planning of the projects, the use of analytical methods of unexpected events in hydrology is unavoidable. The aim of this study was to compare hybrid regression and multivariate regression to estimate flood peak discharge in the province...
متن کاملInvestigating the accuracy of multivariate regression and ARIMA models in predicting water demand (Case Study: Mashhad city)
Awareness of water demand is of particular importance for its policy in urban management. Predicting water demand in the future will allow managers to take the necessary measures regarding sustainable water supply, given the constraints and crises ahead. The purpose of this study is to compare multivariate regression and ARIMA models to predict water demand in Mashhad. In this study, first, the...
متن کاملGENETIC PROGRAMMING AND MULTIVARIATE ADAPTIVE REGRESION SPLINES FOR PRIDICTION OF BRIDGE RISKS AND COMPARISION OF PERFORMANCES
In this paper, two different data driven models, genetic programming (GP) and multivariate adoptive regression splines (MARS), have been adopted to create the models for prediction of bridge risk score. Input parameters of bridge risks consists of safe risk rating (SRR), functional risk rating (FRR), sustainability risk rating (SUR), environmental risk rating (ERR) and target output. The total ...
متن کاملThe Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کامل